AD-A014  608 


SPEECH  UNDERSTANDING  SYSTEMS 
William  A.  Woods,  et  al 

Bolt  Beranek  and  Newman,  Incorporated 


Prepared  for: 

Office  of  Naval  Research 
Advanced  Research  Projects  Agency 


August  1975 


DISTRIBUTED  BY: 


Hatioiial  Technical  Infonnation  Service 
U.  S.  DEPARTMENT  OF  COMMERCE 


Unclassified 


StCOWTr  CLAttIfICATION  or  TmiI  page  nwi*"  *>•••  BnlmttO) 


REP0R1  DOCUMENTATION  PAGE 

READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM . 

«.  RCROnT  number 

BBN  Report  No.  3115 

1.  RECIPIENT’S  catalog  NUMBER 

4.  TITLE  (tn^  SuMlIt) 

SPEECH  UNDERSTANDING  SYSTEMS 

Quarterly  Technical  Progress  Report  nq,  3 

1 May  1975  to  1 August  1975 

S.  TYPE  OF  REPORT  6 PERIOD  COVERED 

Quarterly  Tech.  Prog.  Rep. 
1 May  1975  to  1 Aug.  1975 

6,  PERFORMING  ORG.  REPORT  NUMBER 

BBN  Report  No.  3115 

7.  AUTHURf*; 

William  A.  Wood.s,  Richard  M.  Schwartz,  Craig  C. 
Cook,  John  W.  Klovstad,  Lyn  A.  Bates,  Bonnie 
Nash-Webber,  Bertram  C.  Bruce,  John  I.  Makuoul 

e.  CONTRACT  OR  GRANT  NUMBFRC*, 

N00014-75-C-0533 

«.  PEF'  ORMING  organization  NAME  ANO  AOORESS 

Bolt  Beranek  and  Newman  Inc. 

50  Moulton  Street 
Cambridge.  MA  02138 

10.  PROGRAM  ELEMENT.  PROJECT.  TASK 
AREA  6 WORK  UNIT  NUMBERS 

5D30 

II.  CONTROLLING  OFFICE  NAME  ANO  AOORESS 

ONR 

Department  of  the  Navy 
Arllneton.  VA  22217 

IZ.  REPORT  DATE 

August  1975 

IS.  NUMBER  OF  PAGES 

50 

14  monitoring  AGENCY  NAME  » AOORESSCIf  rfiff.rant  Iram  Conitollind  Ollleo.' 

IS.  security  CLASS,  (of  !/!/•  f«port; 

unclassified 

is«.  declassification/ downgrading 

SCHEDULE 

16.  distribution  statement  Cal  iM*  R«p<>rf| 

Distribution  of  this  document  Is  unlimited.  It  .may  be  released  to 
the  Clearinghouse,  Department  of  Commerce  for  sale  to  the  general 
public . 

17.  OISTPieuTION  STATEMENT  (al  >h«  •b»lr»c.  mnl»r0^  In  Block  20.  II  dllhimt  Iram  Htporl) 


IS  supplementary  notes 


t9  KEY  WOFtDS  (Corttnu*  on  fev«r«9  «id«  tt  tfC99mry  «r»rf  tdmntHy  ty  bl’*ck 

Acoustic-phonetics,  acoustics,  acoustic  seg.ncntation,  augmented  transition 
network,  constituent  boundaries,  data  base,  dip  detector,  formant  tracking, 
formant  smoothing,  fundamental  frequency  contours,  parsing,  partial  matches 
phonological  rules,  property  checking,  pragmatics,  prosodies,  retrieval, 
semantic  networks,  sema p; Ics.  speech  recognition,  speech  understanding. 

20.  ABSTRACT  (Conllnuw  on  MocA  numbmr} 

This  report  covers  research  and  development  work  done  from  1 May  1975  to 
1 August  1975  under  the  Speech  Understanding  Systems  Contract  No.  N00014- 
75-C-0533.  Areas  included  in  this  work  are  acoustic-phonetics,  lexical 
retrieval,  lexical  verification,  and  natural  language  syntax.,  semantics, 
prosodies,  and  pragmatics.  The  report  consists  of  two  parts  — a brief 
survey  of  progress  containing  a ew  paragraphs  describing  the  major 
progress  in  the  individual  components  of  the  project,  and  a Technical  Notes 

DD  1473  ed.tionof  iNovESisoBfOLETE  I’uc laBs  1 f led 

t ECURITY  CLAlllFICATION  OF  THIl  PAGE  Bnlolod) 


\ 


Unclassified 


. ;|RITY  C L A;.-., I u A I ION  OF  THi5  PAGE  (Bftiii  Ihuat.nu  ri  ill 

19.  Key  Words  - cont'd. 

speaker  normalization,  syntax  analysis,  synthesis,  synthesis-by-ruie , 
vocal  tract  length. 

20.  Abstract  - cont'd. 

section  containing  detailed  specifications  of  experiments  performed, 
programs  implemented,  design  studies,  and,  where  appropriate,  supporting 
data  and  appendices.  This  third  QTPR  contains  such  technical  notes  on 
handling  of  time  expressions,  procedural  semantics  for  the  t’^avel  budget 
management  system,  and  control  primitives  for  speech  understanding 
strategies . 


Unclassified 


SE  CURIT  Y Cl  AiSlEIC  AT  .ON  OF  f hiS  P AO  F H “i  m /i.mi  / n . i '. 


ADA014608 


TIITTOO 

BOIT  BERANEK  AND  NEWMAN  ixc 

CONSUITING  • rfVClORMENT  • ICSEARCH 


BBN  Report  No.  3115 
A. I.  Report  No.  34 


( 


SPEECH  UNDERSTANDING  SYSTEMS 

Quarterly  Technical  Progress  Report  No.  3 
1 May  1975  to  1 August  1975 


by 

NATIONAL  TECHNICAL 

ineormation  service 


US  o<  CowMffE.* 

Spn  VA  22ISI 


Sponsored  by 

Advanced  Research  Projects  Agency 
ARPA  Order  No.  29C4 


This  research  was  supported  by  the  Advanced  Research  Projects 
Agency  of  the  Department  of  Defense  and  was  monitored  by  ONR 
under  Contiac':  No.  N00014-75-C-0533 . 

The  views  and  conclusions  contained  in  this  document  are  these 
of  the  authors  and  should  not  be  interpreted  as  necessarily 
representing  the  official  policies,  either  expressed  or  implied, 
of  the  Advanced  Research  Projects  Agency  or  the  U.S.  Government. 


Approval  for  public 
Dlstributkni  Uallialt*d 


Cambridge  Washington,  . c CHICAGO  Houston  tos  anceies  san  francisc 


SPEECH  UNDERSTANDING  SYSTEMS 


Quarterly  Technical  Progress  Report  No.  3 
1 May  1975  to  1 August  1975 


ARPA  Order  No.  290^4 
Propram  Code  No.  5D30 


Name  of  Contractor: 

Bolt  Beranek  and  Newman  Inc. 

Effective  Date  of  Contract: 

30  October  197^ 

Contract  Expiration  Date: 

29  October  1975 


Contract  No.  NOOO 1 4-75-C-0533 

Principal  Investieiator : 

William  A . Woods 
(617)  491-1850  x361 

Scientific  Officer: 

T.  H.  Lautenschlager 

Title: 

SPEECH  UNDERSTANDING  SYSTEMS 

QTPH  Editor: 

Bonnie  Nash-Webber 
(617)  491-1850  x227 


Amount  of  Contract:  $1,041,261 


Sponsored  by 

Aavanced  Research  Projects  Agency 
ARPA  Order  Ho,  2904 


This  research  was  supported  by  the  Advanced  Research 
Projects  Agencv  of  the  Department  of  Defense  and  was 
monitored  by  ONR  under  Contract  No.  NOOO 1 4-75-C-05 33 • 


Bolt  Beranek  and  Newman  Inc. 


BBN  Report  No.  3115 


Table  of  Contents 

pane 

I.  PROGRESS  OVERVIEWS 

A . Acoust ic  Analysis  1 

B . Acoust ic- Phone tics  1 

C.  Lexical  Retrieval  4 

D.  Dictionary  Expansion  5 

E.  Verification  7 

E . Syntax 9 

G . Semantics 9 

H User  4 Discourse  Model  11 

I . Travel  Budnet  Manager 's  Assistant 12 

J Control 13 


Re  ferences 14 


II.  TECHNICAL  MOTES 

A.  Time  Expressions 15 

B.  Procedural  Semantics  in  the  Travel  System  . . 27 

1 . The  Command  Language 27 

2.  Inference  Done  in  the  Course  of  Retrieval  30 

3.  Coordinating  Execution  with 

the  Discourse  Mvde]  33 

C.  Control  Primitives  37 


BBH  Report  No.  3115 


Bolt  Beranek  ano  Newman  Inc 


I.  PROGRESS  OVERVIEWS 
A . Acoustic  Analysis 

One  of  the  main  problems  in  the  accurate  estimation  of 
formants  and  signal  energy  is  the  variability  in  the  pitch 
of  an  individual  speaker  as  well  as  its  variability  across 
speakers.  The  autocorrelatior,  method  of  linear  prediction, 
which  wt  have  been  using  sc  far,  has  the  disadvantage  that 
it  is  sensitive  to  wide  variations  in  pitch,  due  to  the 
interaction  between  the  analysis  window  and  the  pitch 
period.  The  covariance  method  does  not  use  a window  and 
hence  does  not  exhibit  the  same  degree  of  sensitivity  to 
pitch  variations.  However,  it  has  the  disadvantage  that  the 
stability  of  the  computed  model  is  not  assured.  We  are  now 
working  on  a class  of  methods  (due  primarily  to  Itakura  and 
Burg)  which  do  not  require  windowing  and  yet  do  preserve 
stability.  We  hope  to  settle  on  one  method  which  will  prove 
optimal  for  speech  analysis. 


B.  Acoustic- Phonetic  Segmentation  and  Labeling 

This  quarter  we  extended  the  first-pass  segmentation 
process  described  in  the  last  quarterly  progress  report 
[Woods  et  1975bl  to  the  point  where  it  produces  segment 
lattices  v/hich  are  suitable  for  input  to  the  word  matcher. 
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In  this  process,  the  APR  component  starts  by  applying 
I dip  detection  routines  to  three  different  energy  parameters 

: to  produce  three  sets  of  boundaries  of  different  types. 

Dips  in  the  parameter  LEZ  (smoothed  low-frequency  energy 
; from  120-440  Hz)  indicate  likely  obstruents  or  obstruent 

sequences.  Dips  in  MEPZ  (smoothed  mid-frequency  energy  from 

the  preemphasized  signal  between  640-2800  Hz)  which  occur 

i 

I within  sonorant  sequences  indicate  nasals,  back  semivowels 

I [W,L],  flaps  or  intervocalic  obstruents  (e,g,,  [ HH , V , DH , D ] ) , 

[ 

i Dips  in  HEPZ  (smoothed  high-frequency  energy  from  the 

f 

■ preemphasized  signal  between  3400-5000  Hz)  that  occur  within 
sonorant  sequences  indicate  [R]  or  flaps  and  sometimes 
nasals,  [W]  or  intervocalic  obstruents.  Dips  in  HEPZ  within 
obstruent  seauences  indicate  silences  or  weak  fricatives, 

i 

I Merging  these  results  vieldr  a lattice  of  regions  of 

I'r 

! 

■ nine  different  types,  each  cc. rresponding  to  one  or  more 

\ 

j phonemes.  The  remainder  of  the  program  consists  of  rules 

i that  are  region-  and  context-specific.  One  typical  rule 

! 

• looks  at  regions  that  were  classified  as  intervocalic 

sonorants  or  glides,  and  by  looking  for  rapid  changes  in  the 
formants  (mainly  FI)  determines  whether  or  not  it  is  a 
nasal.  Another  rule  looks  at  an  obstruent  or  si.lence  region 
followed  by  a short  frication  region,  and  decides  whether 
the  frication  is  the  aspiration  from  an  unvoiced  plosive  or 
represents  a strident  fricative.  Within  vowel  regions,  the 
three  formants  are  each  described  in  terms  of  a series  of 


-2- 


BDN  Report  No.  3115 


Bolt  Beranek  and  Newnan  Inc 


canonical  shapes.  Based  on  these  representations,  formant 
extrema  or  plateaus  are  identified  as  possible  vowel 
targets.  Vowel  identity  is  determined  using  the  values  of 
the  three  formants  us  normalized  by  the  average  fundamental 
frequency  for  the  utterance  [Schwartz,  1971]  along  with 
durational  constraints.  Those  rules  that  are  optional  add 
branches  to  the  lattice,  while  others  make  a narrower 
specification  of  the  labels  cn  existing  segments. 

In  addition  to  the  vowels,  the  program  currently 
recognizes  individual  unvoiced  plosives  and  fricatives.  It 
also  uses  formant  transitions  to  classify  voiced  plosives 
and  nasals  in  a rough  v/ay.  Prevocalic  [W,R,L,Y]  are 
detected  and  identified  from  formant  transitions. 
Unreleased  plosive-plosive  pairs  are  detected  based  on  the 
duration  of  silences.  In  all,  the  program  uses  60  different 
symbols  to  label  the  segments  of  the  lattice. 

Using  the  Acoustic-Phonetic  Experiment  Facility  [Woods 
et  al.,  1975a,  pp,  20-5?;  Schwartz,  1975],  we  can  compare 
the  labels  in  the  hand-labeled  files  (correct)  with  those 
chosen  by  the  orocran,  in  order  to  create  a confusion  mati  ix 
v/hich  is  used  by  the  word  matcher  in  scoring  po;  sible  v;ords. 
The  performance  of  the  small  set  of  algorithms  is  very 
encouraging,  producing  lattices  with  small  branching  ratios 
and  relatively  few  errors.  More  work  will  be  needed  to 
improve  the  specificity  and  accuracy  of  the  segment  labels. 
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We  will  also  be  spending  more  effort  on  devising  algorithms 
to  correctly  segment  sonorant  sequences. 


C,  Lexical  Retrieval  Component 


In  the  past  Quarter,  work  was  done  on  four  areas  of 
lexical  retrieval  relating  to: 


1)  Generation  of  appropriate  input  for  the  tree, 
compiler , 


2)  Modification  of  the  tree  compiler, 

3)  Extension  of  the  Matcher's  capabilities. 


M)  Use  of  APR  statistics  for  segment  lattice 
generation , 


On  the  first  point,  the  tree  compiler  requires  as  input 
a BCPL-readable  version  of  the  expanded  dictionary  (see 
Section  1,0,)  in  order  to  build  an  appropriate  tree 
structure  for  the  Matcher,  To  this  end,  LISP  programs  were 
written  to  produce  a BCPL-readable  text  file  with  the 
appropriate  information. 


Secondly,  we  extended  the  BCPL  program  which  reads  this 
text  file  and  creates  the  appropriate  tree  structure  to 
recognize  and  encode  certain  inflectional  endings,  to 
associate  a probability  with  every  pronunciation,  and  to 
construct  two  separate  tree  structures,  one  for  scanning 
left-to-right , the  other  right-to-left. 
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Thirdly,  we  extended  the  Matcher  to  use  the  appropriate 
tree  structure  for  a piven  scan.  As  a result,  phonological 
context  can  now  be  taken  into  account  when  scanning  to  the 
right  of,  left  of,  or  between  groups  of  specified  word 
matches.  Furthermore  the  set  of  words  sought  can  be 
specified  by  explicate  enumeration,  membership  in  some 
designated  class,  by  phonetic  length,  or  by  any  combination 
of  these  three.  As  a further  extension  to  the  Matcher,  a 
special  control  language  was  designed  for  efficient 
interfork  communication  with  the  LISP  world.  The  language 
has  been  implemented  as  a BCPL  program  which  reads  LISP 
generated  commands  and  creates  LISP  readable  output. 

Finally,  programs  have  been  written  to  collect 
statistics  as  well  as  to  pad,  adjust,  and  normalize  them  in 
creating  a log  confusion  matrix.  This  matrix  is  then  used 
with  the  present  APR  output  to  create  segment  lattices  with 
probability  vectors  as  segment  descriptors.  Any 
improvements  resultin'^  from  the  modification  and  extension 
of  the  APR  component  can  now  be  ouickly  realized  and 
evaluated , 


Dictionary  Expansion 

During  the  past  quarter,  the  Bobrow-Fraser  rule 
compiler  was  extended  and  the  set  of  phonological  rules  used 
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for  dictionary  expansion  refined  to  a point  of  relative 
stability.  Changes  to  the  rule  compiler  included  the 
incorporation  of  likelihood  numbers  with  optional  rules 
indicating  the  relative  goodness  of  applying  the  rule  versus 
not  applying  it,  and  similar  numbers  associated  with  the 
alternative  base  form  pronunciations  of  v/ords  in  the 
dictionary.  These  numbers  are  multiplied  together  as  the 
words  are  expanded  so  that  each  expanded  form  carries  with 
it  a likelihood  number  which  is  the  product  of  the 
likelihood  associated  with  its  base  form,  the  likelihoods  of 
application  of  all  the  optional  rules  applied  to  it,  and  the 
likelihoods  of  not  applying  of  all  optional  rules  which 
matched  but  were  not  applied  to  it. 


An  additional  extension  to  the  rule  expansion  facility 
now  allows  one  to  associate  a predicate  with  a rule,  making 
the  applicability  of  the  rule  conditional  on  arbitrary 
features  of  the  word  to  which  it  is  being  applied.  (For 
example,  features  such  as  syntactic  part  of  speech,  length 
of  the  word,  and  geographical  or  foreign  origin  of  a word 
could  be  used  to  determine  the  applicabil it v of  a rule,  thus 
permitting  the  inclusion  of  rules  that  apply  only  to  special 
classes  of  words,  such  as  short  function  words,  words  of 
Latin  origin,  etc.) 
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The  current  set  or‘  rules  that  is  being  used  to  expand 
the  dictionary  consists  of  48  rules  that  cover  regular 
inflections,  vowel  reduction,  consonant  syllabification, 
palat.al  izat  ion , stop  insertion  and  deletion,  as  well  as 
dentai  flapping. 


E,  Verification 

During  the  past  quai ter,  we  have  devoted  considerable 
effort  to  debugging  and  improving  our  synthesis-hy-rule 
program.  This  allows  us  to  synthe: ize  in  near  real-time  a 
parametric  representation  of  any  word,  criven  its  phonetic 
transcription-  The  additicn  of  a sophisticated  phonological 
component  to  the  program  has  greatly  improved  the  quality  of 
the  synthesis  output,  by  allowing  us  to  take  into  account 
phonological  effects  across  word  boundaries,  altering  the 
parameterization  according  to  the  context  in  vrhic'^  the 
hypothesized  word  n <y  occur.  In  addition,  it  nega  es  the 
need  to  store  separately  parameterization  for  eac  . entry 
in  the  I'^xicon, 


Time  norma.-  ization  is  .done  using  a dynamic  p -ogramming 
algi.rithm  bas  :>!  on  a method  first  developed  by 
Itakura  [197'i],  The  technique  involves  a non-linear  time 
warping  based  on  the  registration  of  the  error  metric ; in 
this  case,  the  ratio  of  the  linear  prediction  residuals.  We 
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have  modified  hi  method  to  allow  limited  misalignment  in 
ti.iie  between  the  hypothesized  word  parameterization  and  that 
portion  of  an  unknown  utterance  onto  which  we  wish  to  match 
it , 


In  actually  computin,;  the  'distance'  between  a 
hypothesized  word  and  a portion  of  the  unknown  utterance,  we 
sum  the  prediction  residuals  between  corresponding  segments 
of  the  two,  the  corresponder,  ce  having  already  been 
determined  by  the  time  normalization  technique.  Comparing 
the  linear  prediction  residuals  is  a method  of  spectral 
matching,  specifically  the  spectra  of  the  all-pole  models. 

At  this  time,  we  have  integrated  the  synthesis-by-rule 
program  with  the  tine  normalization  algorithm  and  the 
parametric  word  matcher  into  a single  component  which  runs 
interactively  from  a console.  Given  a phonetic  spelling  of 
a hyoothesized  word  torether  with  available  context,  the 
component  synthesizes  the  parameterization  of  the  word  for 
that  context  and  matches  it  onto  the  specific  region  of  the 
parameterization  o.’  the  unknown  u*'terance  specified  by  the 
user.  The  verificc  ion  component  returns  a score, 
normalized  v;ith  the  duration  of  the  hypothesized  word, 
indicating  the  likelihood  of  that  word  occurring  at  the 
given  position  in  the  utterance.  The  interactive 
implementation  operates  presently  in  3-^  times  real  time. 
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F.  Syntax 

This  quarter  saw  the  completion  of  a major  report  on 
the  syntactic  component  of  the  BBN  speech  understanding 
system,  which  is  to  be  printed  in  early  fall. 

With  respect  to  the  grammar,  we  began  to  design  a 
sub-grammar  for  adverbial  time  modifiers  (see  Section 
II.A,),  An  additional  change  to  the  grammar  was  the 
addition  of  an  expanded  proper  noun  network  to  parse 
people's  names  (first  or  first  and  last)  and  plac^  names 
(e.g.,  "Pittsburgh,  Pennsylvania,"  "Paris,  France,"  etc.). 
With  respect  to  the  parser,  two  flags  were  added  so  that  a 
human  simulator  (or  control  program)  has  available  options 
to  force  the  following  of  all  parse  paths  instead  of  just 
the  current  best  ones,  and  to  cause  proposals  to  be  made  for 
all  monitored  syntactic  classes  rather  than  those  classes 
with  a small  number  of  members. 


G,  Semantics 

In  the  past  quarter,  we  have  brought  up  a new  version 
of  the  semantic  network  package  (SEMNET)  which  supports  the 
maintaining  of  a semantic  network  on  an  external  disk  file 
instead  of  in-oore.  This  has  several  advantages  for  us, 
including  lower  in-core  storage  requirements,  increased 
filing  speed,  and  better  control  over  multiple-user  access. 
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The  primary  impetus  for  this  new  implementation  was  the 
growing  size  of  the  SPEECHLIS  semantic  network,  whicn  is 
being  used  to  store  all  our  semantic,  pragmatic  and  dat? 
base  information  about  the  travel  budget  management  domain. 
At  this  writing,  the  number  of  nodet.  in  the  common  network 
has  reached  875,  with  148H  two-wa.  ’nks  and  2379  one-way 
links.  This  is  a large  drain  on  storage,  and  we  believe 
that  not  all  these  nodes  will  ever  be  accessed  in  any  one 
session  by  any  one  knowledge  source.  The  cumbersomeness  of 
merging  networks  (cf,,  MERGESEMNET  in  QTPR  2),  after  several 
users'  simultaneous  changes  have  resulted  in  several 
slightly  different  versions  of  the  network,  was  another 
reason  fo*’  desiring  a new  implementation,  one  which  would 
make  impossible  simultaneous  chances  to  the  network. 


In  the  new  implementation,  only  a few  things  about  the 
network  are  initially  loaded  into  core: 


1,  The  set  of  terms  and  each  one's  SREF  property  whose 
value  is  the  semantic  network  node  to  which  the  term 
refers, 

2,  The  semantic  network  array,  containing  not  the  set  of 
links  and  properties  associated  with  each  node,  but 
rather  pointers  into  a separate  file  (the  ’’guts”  file) 
in  which  this  information  is  stored, 

3,  The  set  of  global  variables  used  in  network 

manipulation;  e,g,,  a pointer  to  the  free  list 

(FREELIST),  a pointer  to  the  highest  network  array  cell 
thus  far  used  (NITEMS),  the  name  of  the  guts  file,  the 
size  of  the  network  array,  etc.  When  a node  1 
accessed,  its  corresponding  array  cell  is  checked,  1 
it  still  contains  a pair  of  file  pointers,  the  link 
information  about  the  node  is  read  in  from  the  guts 
file  before  processing  continues  as  usual.  If  the 
network  is  edited  and  refiled,  information  about  nodes 
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that  have  not  been  loaded  in-core  is  copied  quickly  and 
directly  from  the  previous  guts  file. 

One  has  the  option  of  opening  the  guts  file  either 
"thawed"  or  protected.  If  one  chooses  the  latter,  ono  can 
prevent  simultaneous  changes  to  the  network  by  more  than  one 
user.  If  one  chooses  the  former,  however,  one  can  thereby 
allow  run-time  access  to  the  net  by  several  different 
processes. 


H,  User  and  Discourse  Model 

An  augmented  transition  network  (ATN)  grammar  is  being 
used  to  represent  some  of  the  common  modes  of  interaction 
found  in  travel  budget  management  dialogues,  A modified  ATN 
parser  has  been  written  that  steps  through  this  grammar  on 
the  basis  of  the  input  sentence  structure  and  the 
then-current  state  of  the  data  base.  At  any  given  state  the 
parser  can  predict  the  most  likely  next  intent  and  hence 
such  things  as  the  mood  and  import  of  the  next  utterance. 

We  are  also  exploring  the  use  of  a more  flexible 
discourse  model  that  may  partially  or  wholly  supplant  the 
current  ATN  model.  This  latt  • model  is  based  on  the  no*^  n 
of  a set  of  pending  "der.iands"  and  "counter-demands,"  (A 
sketch  of  this  model  is  presented  in  Section  II, B, 3.) 
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The  Travel  Budget  Manager 's  Assistant 

The  current  task  domain  of  the  BBN  speech  project  is 

that  of  assisting  a travel  budget  manager.  It  is  meant  to 

help  the  manager  keep  a record  of  trips  taken  or  proposed 
and  to  produce  summary  information  such  as  the  total  money 

allocated.  It  is  a simplified  example  of  many  other 

resource  management  problems  of  essentially  the  same  type 
and  is  an  initial  step  toward  an  intelligent  manager's 
assistant.  The  data  base  management  facilities  of  the 
systt  are  accessed  through  a formal  command  language  (see 
II.B.),  into  which  spoken  requests  will  be  translated.  The 
command  language  operates  on  a set  of  data  base  structures 
representing  such  things  as  trips,  contracts,  budgets, 
conferences,  dates,  fares,  and  lengths  cf  time.  The 
structures  for  times  are  discussed  in  II, A, 

In  the  past  quarter,  we  have  developed  a preliminary 
set  of  programs  to  allow  the  travel  budget  manager's 
assistant  to  respond  to  tne  manager  in  an  English-like 
language.  For  example,  it  may  describe  a trip  as: 

John  Makhoul  is  going  to  Pittsburgh  from  Monday, 

the  30th  of  June  to  Wednesday,  the  2nd  of  July, 

1975. 

We  have  also  been  running  simulations  to  develop  and 
circumscribe  the  travel  budget  manager's  assistant.  In  the 
simulations,  one  person,  sitting  at  terminal  A,  plays  the 
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role  of  the  travel  bud«^et  manager,  typing  in  sentences  ->s  if 
he  were  talking  to  a complete  travel  budget  mana.,cr‘s 
assistrnt.  Another  person,  at  terminal  B,  intercepts  his 
sentences  and  translates  them  into  the  formal  command 
language  mentioned  above  and  passes  the  translations  to  the 
retrieval  component  for  execution.  These  simulations  also 
provide  a source  for  dialog  protocols  and  new  words  that 
should  be  included  in  the  lexicon. 


J.  Control 

During  the  past  quarter,  we  cont?nuec  our  v;ork  in 
developing  specific  control  strategies  that  would  take: 
suitable  advantage  of  the  dilferent  capabilities  of  the 
individual  high  level  components.  For  example,  our 
increased  confidence  in  the  results  of  lexical  retrieval, 
brought  about  by  great  improvements  in  the  rratch  component, 
is  leading  to  strategies  which  rely  more  on  that  component 
in  assembling  and  evaluating  theories.  We  are  working  in  a 
mode  of  incremental  simulation  in  order  to  recognize  and 
develop  these  strategies,  and  to  ease  the  task,  a set  of 
"primitive”  control  operations  have  been  isolated  and 
implemented,  which  are  discussed  in  detail  in  Section  II, C, 
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II.  TECHNICAL  NOTES 

A.  Time  EAUresslons 

Lyn  Bates 

Bertram  C,  Bruce 

References  to  dates  and  time  are  frequent  in  the  travel 
budget  management  domain.  The  manager  needs  to  know  what 
trips  are  scheduled  for  the  current  budget  period,  how  long 
a given  trip  is  (and  therefore,  how  expensive),  and  what 
conflicts  there  may  be  among  conference  dates  or  planned 
trips.  In  order  to  understand  these  time  expressions  and  to 
process  them  correctly,  we  have  written  a set  of  programs 
which  (1)  oarse  time  and  date  expressions,  (2)  convert  the 
parse  structures  into  structures  well  suited  for  inference 
and  retrieval,  (3)  calculate  ordering  relations  among 
(perhaps  incompletely  specified)  time  points,  (4)  calculate 
lengths  of  time  from  (perhaps  incompletely  specified)  time 
periods,  and  (5)  generate  English  descriptions  of  the  time 
information.  This  cechnical  note  is  a discussion  of  the 
scope  of  the  problem  we  are  norking  on,  the  time/date 
grammar,  the  parse  structures  and  the  data  base 

representation  for  the  time  information.  Thus  it  is 
basically  a presentation  of  ciie  above  points  (1)  and  (2). 

Before  discussing  the  details  of  representation,  we 
should  poit.t  out  what  is  not  being  attempted.  We  are  not 
trying  to  handle  every  conceivable  expression  which  has  any 
temporal  import.  We  do  not  expect  the  mechanisms  discussed 
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here  to  process  "the  driest  season  in  the  last  ten  years," 
"the  last  week  of  our  most  recent  contract  year,"  or  even 
"the  week  of  April  10." 

Instead  of  attempting  total  generality,  we  are 
deliberately  isolating  a rather  large  subset  of  English  time 
phrases  which  can  be  processed  in  a somewhat  isolated  and 
efficient  manner  without  recojrse  to  extensive  semantic 
analysis.  Thus  phrases  like: 

Late  last  week  John  left  for  Chicago. 

We  spent  money  during  July. 

Will  Bill  go  to  Washington  ijn  April? 

How  much  did  we  spend  this  last  quarter? 

He  is  going  late  in  the  fall. 

Lyn  is  going  to  Colorado  on,  August  fifteenth. 

can  be  parsed  independently  of  the  rest  of  the  parsing  and 
packaged  for  the  data  base  without  the  usual  semantic 
processing.  Besides  being  efficient,  this  allows  us  to 
concentrate  on  other  syntactic-semantic  problems,  and  does 
not  preclude  handling  a more  general  class  of  time 
expressions  in  the  normal  way. 

The  augmented  transition  network  (ATN)  for  the  time 
grammar  is  shown  in  P’igure  1.  In  the  figure,  WEEKDAY  and 
MONTH  are  syntactic  categories.  ORD/  and  NUMBER/  are  ATN's 
themselves  that  are  not  shown  which  recognize  ordinal  and 
cardinal  numbers  respectively.  The  tests  on  the  arcs 
preclude  expressions  like  "Thursday  the  five,"  "thirty 
January  fifteenth,"  and  "the  ten  of  June." 
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Syntax  uses  the  time  prammar  of  Figure  1 to  build  a 
special  parse  structure  which  is  pererally  a substructure  of 
that  shown  in  Figure  2,  This  structure  leaves  out  function 
words  within  the  time  phrases  whose  meaning  is  only  to 
determine  which  structure  is  to  be  built#  The  whole  time 
parse  structure  can  serve  as  the  object  of  a preposition,  as 
an  adverb,  or  as  an  adjective. 

We  currently  allow  at  most  one  ordinal  and  one 
adjective  for  the  phrase  as  a v;hole,  e#5#,  "the  last  of  the 
year,"  There  may  also  be  an  ordinal  and  adjective  on  each 
subunit  of  the  structure#  For  each  subunit,  e#g#  MONTH, 
there  may  or  may  not  be  a third  link  pointing  to  the  value. 
For  example,  "next  May"  has  the  structure# 

(TIME  (MONTH  (ORD  NEXT)  MAY)) 
whereas  "next  month"  is  simply, 

(TIME  (MONTH  (ORD  NEXT))) 

Some  representative  phrases  with  their  parse  structures  are 
shovm  in  Figure  3# 

Once  the  parse  structure  is  completed,  it  becomes  a 
sort  of  "black  box"  marked  as  a tine  expression.  That  is. 
Semantics  does  not  need  to  analyze  it  nor  bother  with 
connections  between  elements  outside  the  time  expression  and 
subunits  of  the  time  expression#  Instead,  a data  base 
function,  TIMEBUILD,  takes  the  parsed  time  expression 

directly  and  builds  the  appropriate  data  base  structures, 
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^CAT^WE^AY 
"'PUSH  NUMBER/ 


Figure  1.  The  ATN  grammar  fragment  for  time  expressions. 
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Figure  2.  Potential  structures  built  by  the  parser 
for  time  expressions. 
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THE  LAST  DAY  OF  THE  QUARTER 


WEEKDAY  QUARTER 

ORD  DAY 

LAST 


FRIDAY  THE  10^^  OF  JUNE 
TIME 


3Y  M 


WEEKDAY  I MONTH 
I DfY  I 
FRIDAY  10  JUNE 


LATE  LAST  SPRING, LATE  IN  LAST  SPRING 
TIME 

ADJ  SEASON 

! i 1 

LATE  op  SPRING 
LAST 


EARLY  iN  THE  FIRST  QUARTER  OF  THE  FISCAL  YEAR 

TIME 


EARLY  Oi 
FI 


Ap  QUARTER  YEAR 
RD  ADJ 


RST  FISCAL 


Figure  3.  Examples  of  parse  structures 
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For  example,  the  phrase,  "on  a Tuesday  in  June,  1975" 
is  parsed  into  the  structure, 

(TIME  (WEEEHAY  TUESDAY )( MONTH  JUNE) (YEAR  1975)) 

TIMEBUILD  user  this  parse  structure  to  build  a data  base 
structure  such  as  shown  in  Figure  A,  in  order  to  make  the 
data  base  structure,  TIMEBUILD  must  consiorr  the  ordinals 
and  adjectives  and  perform  appropriate  transi  o'^nations  on 


the  values  for  each  subunit 
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Figure  Structure  of  a time  point  for 

"on  3 Tuesday  in  June,  1975." 
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The  current  version  of  TIMEBUILD  processes  the  MONTH 
portion  of  a TIME  parse  structure  as  follows:  If  there  is  no 
ORD  (ordinal)  link  then  the  MONTH  number  for  the  month 
(e.g.,  August  ->  8)  is  stored  as  is.  If  there  is  no  month 
value,  as  in  "this  month",  then  the  ORD  link  is  used  to 
calculate  the  appropriate  MONTH  and  YEAR  from  the 
corresponding  values  on  NOW,  where  NOW  is  a globally 
accessible  tim"=>  point  which  represents  the  current  time. 
Note  that  for  time  expression  we  are  treating  "this," 
"next,"  and  "last"  as  ordinals.  If  there  is  both  an  ORD 
link  and  a month  value,  then  each  type  oi  ordinal  has  its 
own  interpretation.  For  instance,  "next"  is  interpreted  as 
the  first  future  occurrence  of  the  specified  month,  e.g.,  if 
NOW  is  June,  1975,  then  '-next  August"  means  August,  1975  and 
"next  May"  means  May,  1976.  Other  portions  of  the  TIME 
parse  structure  are  processed  in  a similar  way.  The  current 
interpretations  are  only  approximate  and  need  to  be 
buttressed  by  a consideration  of  tense,  topic  and  discourse 
structure. 

In  addition  to  time  points,  the  data  base  has 
representations  for  time  periods  and  lengths  of  time. 
Examples  of  these  are  shown  in  Figures  5 and  6.  The  time 
period  is  the  highest  level  time  structure  and  may  be  linked 
by  TIME/OF  to  some  event,  (In  Figure  5,  the  time  period  is 
associated  with  a trip).  The  time  period  has  a CPZA'^E/TIME 
which  is  the  time  the  node  was  added  to  the  data  base. 
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The  time  period  may  be  completely  or  partially 
specified.  For  example^  one  might  know  only  the  length  of  a 
trip  and  not  its  starting  time;  or  one  might  know  when  it 
starts  but  not  when  it  ends  (or  its  length).  The  inferenr 
routines  (see  Section  11,6,)  are  able  to  process  such 
information  in  various,  incomplete  forms  and  produce  results 
at  the  maximum  possible  information  level. 


Time  points  and  lengths  of  time  are  the  main  components 
of  a time  period.  Both  have  year,  month,  day,  hour  and 
minute  values.  In  addition  time  points  have  links  to  those 
time  points  which  they  precede  or  follow  (only  if  that  can 
not  be  determined  directly  from  their  values). 


V/e  plan  to  continue  work  on  the  time  grammar  in  several 
areas.  One  is  to  make  the  grammar  more  selective  especially 
about  prepositions.  For  example,  one  says 

late  on  the  first  Tuesday  iri  June 


bv’.t 


late  in  the  last  week  of  June, 


The  time  grammar  also  needs  to 
with  the  rest  of  the  travel  grammar, 
of  nominal  time  phrases  as  adverbials 
to  California  next  week, ) needs  t-^  be 


be  better  integrated 
In  particular  the  use 
(e,p,,  John  is  goitig 
v/orked  out  more  fully. 
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B V P^^ocedural  Semantics  in  the  Tra  /el  System 

Bertram  C,  Bruce 
Sregory  Harris 

1.  The  Command  Language 

A formal  command  language  for  interacting  with  the 
travel  budget  management  data  base  has  now  been  incorporated 
into  SPEECHLIS,  On  its  own  it  functions  as  a tool  to  build 
and  access  the  travel  data  base.  In  the  context  of 
processing  utterances,  it  will  serve  as  the  goal  language  of 
semantic  interpretation  applied  to  the  parse  trees  built  by 
Syntax  and  the  caseframes  built  by  Semantics. 

Sentences  of  the  command  language  consist  of 
expressions  built  out  of  operators  and  their  arguments.  The 
operators  specify  operations  to  be  performed  on  the  data 
base  or  interactions  with  the  user.  The  arguments  may 
either  refer  to  elements  of  the  semantic  network  or  be 
arbitrary  constant  expressions.  In  the  first  case,  the 
argument  may  be  specified  by  its  print  name,  its  index  in 
the  network,  or  by  a LISP  expression  to  be  evaluated.  Some 
examples  of  English  sentences  and  their  expression  in  the 
command  language  are  given  belov/: 

(a)  Bill  is  going  to  Chica.  o on  !1aroh  15th. 

(BUILD:  DB/TRIP 

(TRAVELER  (FIND:  PERSON  (FIRSTNAME  'RILL))) 

(DESTINATION  'CHICAGO) 

(BEGIN/TII-iE  '(TI'1E(H0MTH  MARCH)(DAY  15)))) 
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(b)  Estimate  a cost  for  his  trip. 

(FOH:  THE  XI  / (FIND:  DB/TRIP 

(TRAVELER  (FIND:  PERSON  (GENDER  MALE)))) 
: T ; (GENERATE:  (COMPUTE:  XI  'COST))) 


(c)  Print  out  all  scheduled  trips. 

(FOR:  EVERY  XI  / (FIND:  DB/TRIP  (MODALITY  'SCHEDULED)) 

: T ; (GENERATE:  XI)) 


(d)  When  is  Lyn  going  to  London? 

(FOR:  THE  XI  /(FIND;  DB/TRIP 

(TRAVELER  (FINE,:  PERSON  (FIRSTNAME  'LYN)))) 
(DESTINATION  'LONDON)) 

: (AFTER?  (GET:  XI  'TIME)  NOW) 

; (GENERATE:  (GET:  XI  'TIME))) 


(e)  How  lonrr  is  Bill's  trip  to  Chicago? 

(FOR:  THE  XI  / (FIND:  DB/TRIP 

(TRAVELER  (FIND:  PERSON  (FIRSTNAME  'BILL))) 
(DESTINATION  'CHICAGO)) 

: (AFTER?  (GET:  XI  TIME)  NOW) 

; (GENERATE:  (GET:  XI  '//DAYS))) 


(f)  When  is  the  first  trip  Chip  is  taking  next  year? 

(FOR:  (ORD  1)  XI  / (FIND:  DB/TRIP 

(TRAVELER  (FIND:  PERSON  (FIRSTNAME  'CHIP)))) 
: (DURING  (GET:  XI  'BEGIN/TIHE) 

(TII1EDUILD  '((ORD  NEXT)  (YEAR) ))  ) 

; (GENERATE:  XI)) 


The  present  implementation  of  some  of  the  functions 
used  in  the  command  language  is  described  below; 


(ADD:  node  link  value) 

Adds  value  to  node  under  the  attribute  link.  If  link 
has  an  ADDFN  property  associated  with  it,  then  the 
value  of  that  property  is  a procedure  which  is  executed 
to  add  value . Otherwise,  SEMNET  primitives  are  used  to 
moke  the  appropriate  relational  or  property 
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connections, 

(COHPUTS:  node  link) 

A command  to  compute,  as  opposed  to  just  find,  a value 
for  node  and  link;  equivalent  to  (GET:  node  link  ? 
DEFAULT).  (See  below.) 

(GET:  node  link  value  flap) 

If  value  is  NIL  or  ? then  GET:  follows  link  from  node 
and  returns  value.  Otherwise  it  verifies  (or  denies) 
that  the  specified  value  is  stored.  Flap  determines 
the  extent  of  search  if  the  value  is  not  stored 
explicitly.  Currently,  if  flag  is  T then  no  search  is 
done.  If  it  is  DEFAULT,  then  inferences  are  done  as 
determined  by  METHODS  associated  with  the  link  name 
(see  II. B. 2).  If  no  METHOD  succeeds,  then  the  speaker 
is  asked  for  helu.  If  flag  is  NIL  then  the  speaker  is 
attain  consulted, 

(BUILD:  item-tvpe  (linki  valuel)  (link2  value2)  ,..) 

Builds  an  item  which  is  an  instance  of  item-tvpe  and 
has  the  specified  link-value  pairs.  Uses  ADD:  for  each 
pair  and  also  adds  D6/CRHAT0R  and  CREATE/TIME  links, 

(FIND:  item-type  (linki  valuel)  (link2  value2)  ,,,) 

Finds  an  item  which  is  an  instance  of  item-tvpe  and  has 
the  specified  link  value  pairs.  Uses  GET;  for  each 
pair,  FIND:  is  an  enumeration  function  which  can  be 
used  with  FOR:  (see  beloif)  co  produce  elements  one  at  a 
t ime . 

(FOR:  quantifier  variable  / clajs  : restriction  ; command) 

Applies  command  to  elements  of  class  for  which 
restriction  holds  and  as  determined  by  quantifier. 
Variable  is  bound  to  elements  of  the  restricted  class 
and  is  a free  variable  in  command . (The  permissible 
values  for  quant  if ier  have  been  generalized  from  those 
in  LUNAR  [L’oods,  W.  A.,  R,  M,  Kaplan  and 

H,  Nash-Webber,  1Q?2]  system  to  allow  specification  of 
'Cardinal  s by  (THE  <numher>).  Also,  FOR:  now 

distinguishes  the  universal  quantifier,  EACH,  which 
requires  that  at  least  one  item  belong  to  class  from 
its  counterpart,  everv  which  does  not.) 

(COMPLETE:  item) 

Special  command  v/hich  searches  through  item  description 
and  asks  for  missing  values.  It  stops  when  the 
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description  is  complete  or  when  the  user  says  "stop". 
COMPLETE:  also  allows  the  usei’  to  say  "unknown”  t'  any 
question. 

(GENERATE:  arg) 

Examines  arg  to  determine  appropriate  form  of  printout. 
If  arg  is  an  item  whose  item-type  has  a PRINTFN  then 
that  procedure  is  used  to  print.  If  there  is  no 
PRINTFN  then  SEMNET  printing  primitives  are  used. 
GENERATE:  also  prints  strings  and  lists. 

2.  Inference  Done  in  the  Course  of  Retrieval 

Inference  in  this  system  can  be  viewed  as  a natural 
ffeneralization  of  the  notion  of  structures  with  slots  and 
default  values  for  each  slot.  Here,  instead  of  being 
values,  defaults  are  procedures  for  determining  the 
appropriate  value  whenever  a slot  filler  is  missing.  These 
procedures  may  require  the  values  of  other  slots,  which  in 
turn  nay  require  activatin?  other  default  procedures.  The 
i iference  process  also  contains  an  advice-passinp  mechanism 
that  gives  it  a modicum  of  control  and  an  ultimate  default, 
which  is  to  ask  the  speaker. 

The  current  inference  process  is  implemented  via  the 
function,  GET:.  GET:  can  be  used  to  find  the  value  for  a 
node-link  pair  or  to  verify  that  a specified  value  is  there. 
A flag  can  be  set  that  determines  the  depth  of  search  and 
whether  or  not  the  speaker  is  to  be  asked  in  the  event  of 
failure.  The  effect  of  the  GET:  implementation  is  that  the 
basic  operation  of  requesting  the  value  of  an  attribute  of 
an  object  is  not  conditioned  by  (perhaps  arbitrary)  data 
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base  constructions.  It  also  means  that  whereas  a nev; 
attribute  must  be  structurally  defined,  there  does  not  need 
to  be  a special  set  of  functions  for  retrieving  its  value 
under  an  indefinitely  lar^e  assortment  of  situations, 

For  example,  the  call  (GET:  <trip>  'COST)  is  produced 
as  part  of  the  interpretation  of  "the  cost  for  that  trip," 
If  "cost"  were  stored  explicitly,  then  no  deduction  would  be 
required.  If  not,  a cascade  of  calls  to  GET:  can  result, 
based  on  the  default  functions  for  "cost"  (see  Figure  7), 
Advice  can  be  passed  from  hifrher  to  lovjer  level  calls  to 
"uide  and  constrain  the  inference  process. 

The  recursive  mechanism  of  GET:  is  driven  by  the 
pronerty,  METHODS  on  the  link  name  specified  in  the  call  to 
GET:.  Each  METHOD  consists  of  an  APPLICABILIT7/TE3T  which 
restricts  the  application  of  the  method,  a FUNCTION  naminm 
an  operation  to  be  performed,  and  ARGUMEIjT/PATHS  which 
specify,  for  each  argument  of  the  FUNCTION,  v;hat  links  to 
follow  (via  GET:)  from  the  present  node  to  met  the  desired 
values.  As  each  method  is  applied,  it  builds  a 
GEMERATii]-able  trace  of  its  computation  tree  such  as  that 
shown  in  Figure  7.  (The  actual  printinr  of  this  tree  is  not 
yet  inplenented , ) The  tree  enables  the  system,  after 
estimating  a cost,  for  example,  to  answer  the  question  "Hov; 
did  you  t^et  that?"  It  also  can  set  monitors  on  Questions 
about  trivially  different  computations,  e.r,,  "What  if  the 
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(GET;  TRIP  'COST) 


(GET:  TRIP  'ROUND/TRIP/FARE)  + ((GET;  TRIP  * PER/DIEM)  » (GET;  TRIP  'lyPAYS)) 


(DIFFERENCE/ IN/TIME  (GET:  TRIP  • END/TIME) 

(GET:  TRIP  'BEGIN/TIME)) 


Figure  7.  A trace  of  the  inference  process 
obtaining  the  cost  of  a trip. 
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per  diem  were  twenty  dollars?"  "What  if  it  were  for  six 
days?" 

3.  Coordinatinr:  Execution  with  the  Discourse  Hodel 

The  formal  command  lan,0:ua,n;e  has  been  desi.ffned  to  permit 
a fairly  direct  mappinp  of  an  input  English  sentence  into 
its  underlying  concepts  without  repard  for  how  information 
is  actually  stored.  Thus  we  have  (TRAVELER 
(FIND*  PERSON  (FIRSTNAflE  'BILL)))  and  (GET:  <trip>  'TI[1E) 
even  thoui^h  discourse  context  must  be  used  to  pick  which 
"BILL"  is  meant,  trips  have  their  times  associated  with 
their  individual  lers. 

We  have  found  the  notion  of  a demand  queue  model  useful 
in  accounting  for  discourse  reference.  It  also  helps  to 
explain  how  one  computation  of  a response  can  be  pushed 
down,  while  a whole  dialopue  takes  place  to  obtain  nissinp 
information,  and  how  a computation  can  spawn  subsequent 
expectations  or  di^^ressions , Some  elements  of  this  demand 
model  are  explained  beloi;: 

(a)  Demands : These  are  demands  for  service  of  some  sort 
made  upon  the  system  bv  the  user  or  by  the  system  itself. 
An  active  unanswered  Question  is  a typical  demand  with  hiph 
priority.  The  fact  that  some  questions  cannot  be  answered 
without  more  information  leads  to  the 
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user-makes -query 
system-asks-quest ion 
user-clarif ies 
system-answers-query 

kind  of  embedding  we  have  been  calling  a "mode  of 

interaction,"  Demands  of  lower  priority  include  such  things 
as  a notice  by  the  system  that  the  manager  is  over  his 
budget.  Such  a notice  might  not  be  communicated  until  after 
direct  questions  had  been  answered, 

(b)  Counter-demands : These  are  questions  the  system  has 

explicitly  or  implicitly  asked  the  user.  While  it  should 
not  hold  on  to  these  as  long  as  it  does  to  demands,  nor 

expect  too  strongly  that  they  will  be  met,  the  system  can 
reasonably  expect  that  most  counter-demands  will  be  resolved 
in  some  way.  This  is  an  additional  influence  on  the 
discourse  structure, 

(c)  Current  topic ; This  is  the  active  focus  of 

attention  in  the  dialogue.  It  could  be  the  actual  budret,  a 
hypothetical  budget,  a particular  trip,  or  a conference. 
The  current  topic  is  used  as  an  anc  or  point  for  resolvinc 
references  and  deciding  how  much  detail  to  fzive  in 

responses.  Again,  this  structure  leads  to  certain  modes  of 
interaction.  For  example,  if  the  manarer  says  "Enter  a 
trip,"  the  system  notes  that  the  current  topic  has  changed 
to  an  incompletely  described  tri.  This  results  in  demands 
that  cause  standard  fill-in  questions  to  be  asked.  If  the 
manager  wants  to  complete  the  trip  description  later,  then 

-34- 


BBN  Report  I'io.  ?11S 


Bolt  Beranek  and  Newrnan  Inc 


the  completion  of  the  trip  description  becomes  a low 
priority  demand, 

(d)  lliscel laneous  deictic  st  ru-'tures ; The  discourse 
area  of  the  data  base  also  contains  an  assortment  of  jter'3 
stronp;ly  linked  to  the  here  and  now,  includinr: 

a)  HOW,  a pointer  to  the  current  tine  and  date, 

b)  SrSAKER,  a pointer  to  the  cur'  .-nt  speaker, 

0)  the  last  mentioned  person,  pl''ce,  time,  trip, 
bud'ct,  conference,  etc. 

We  are  desip'ninp'  a preliminary,  one-queue 
implementation  of  this  "demand  model,"  This  queue  v/ill 
consist  ot  fmrris  such  as  (DO  (FOR:  — ) --)  which  represent 

the  sneaker's  nrevious  oueries  and  commands  as  well  as 
commands  initiated  hv  the  system  to  examine  the  consenuences 
of  its  acticiis,  <’ive  information  •‘o  the  user,  or  check  for 
data  base  consistency,  Thesa  forms  are  . ^lated  by 
functional  dependencies  and  relative  prioritic;S,  At  th^ 
Dresent  time,  there  are  only  a few  demand  types:  DO  means 

execute  the  specified  command,  TEST  means  evaluate  the  form 
to  'ilL  or  non-iJIL  and  answer  "no"  or  "yes"  accordinf^ly , 
RESPO'JD  means  mive  the  user  some  information  (which  nay  or 
may  not  be  nart  of  an  ansv/er  to  a direct  query),  PREVSllT 
’"cans  monitor  for  a subsequent  possxble  action  block  its 

normal  execution  (as  in  "Do  not  allow  more  tlian  three  trios 
to  rbironc,"). 
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C*  Control  Primitive^ 

Bonnie  Nash-Webber 

To  aid  us  in  devising  and  simulating  possible  control 
strategies,  we  have  isolated  into  separate  user-callable 
functions  those  operations  which  our  current  set  of  data 
structures  suggest  to  be  primitive,  plus  other  functions 
that  have  seemed  useful.  The  current  set  of  such  functions 
is  undoubtedly  incomplete.  New  data  structures  and  other 
ways  of  relating  instances  of  current  ones  to  each  other 
will  most  likely  lead  to  new  control  primitives.  We  have 
been  using  the  following  set  of  primitives  in  the 
incremental  simulations  of  the  speech  system  run  in  the  last 
quarter.  (Excerpts  from  one  such  incremental  simulation 
session  foli(.'W  this  section.) 

1.  A fr.iiction  for  reading  in  a new  utterance:  SENTENCE! 

2.  A function  for  creating  a lattice  of  the  highest 

lexically  scoring  wordmatches:  SCAN 

3.  A function  for  making  a theory  of  a set  of  wordmatches: 

MKTHRf 

4.  h function  for  refining  a theory  with  a new  wordmatch: 

REFTHRY 


5.  Functions  for  evaluating  a theory:  SEMVAL,  SYNVAL, 

PRAGVAL 

6.  Functions  for  constructing  user-made  proposals: 

WORDPSL,  CATPSL,  BOTHPSL 

7.  Functions  for  doing  the  proposals  (i.e.,  sending  them 

down  to  the  lexical  retrieval  fork):  DOPSL,  DOPSLS 

8.  A function  for  removing  a proposal  from  the  proposals 

list  without  having  done  it:  REMOVEPSL 
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9.  Functions  for  doinp,  the  events  made  by  some  component: 
DOEVT,  DOEVTS,  DOWORDEVTS 

10.  A function  for  printing  any  of  the  control  data 

structures:  P 

11.  A function  for  doing  a more  thorough  scan  on  a region: 

SCANREGIOFI 

12.  A function  for  ascertaining  the  existence  of  better 

matches  of  a particular  word,  given  one  match  for  that 
V7ord  has  already  been  found:  BETTERFIATCH? 

13.  A function  for  creatinp’  fuzzy  wordmatches:  FUZZ? 

1A.  '^unctions  for  talking  to  the  various  forks  directly: 
MATCHCONTROL,  SYNCONTROL 

Notice  several  thirgs  about  the  abo^/c  proups  of 
functions.  First,  v/e  have  kept  separate  the  notion  of 

creating  a oroposal  from  that  of  actually  doing  it.  This 
allov/s  oroposals  to  be  queued  and  selected  later.  Secondly, 
proposals  can  now  be  made  by  either  a SPSECrILIS  comoonent  or 
the  user.  This  allows  the  user  to  make  proposals,  while 
postponing  the  decision  about  which  component  should  have 
had  the  smarts  to  do  so  itself.  Thirdly,  v/e  have  tried  to 
be  somewhat  consistent  in  naming  conventions,  anything 

with  ?SL  in  its  name  refers  to  proposals,  THRY  to  theories, 
ZVT  to  events,  VAL  to  evaluation.  Finally,  evaluation  of  a 
theory  by  a component  may  involve  that  component's  making  a 
hypothesis  about  how  the  words  fit  together,  as  well  as 
comparing  that  hypothesis  against  information  already  in  the 
theory.  For  example,  if  Syntax  goes  first,  the  consistency 
of  Syntax  and  Semantics  is  part  of  the  SEFIVAL  evaluation, 
while  if  Semantics  evaluates  the  theory  first,  the 
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consistency  check  is  part  of  SYNVAL. 


Functions 


1.  Reading  in  a sentence. 

[SENTENCE!  <utterance>  <suffix>] 

An  NLAMBDA  which  sets  up  the  lexical  retrieval  fork  on 
the  given  utterance  and  also  upper  level  internal 
structures  which  depend  on  the  utterance.  <suffix> 
refers  to  the  suffix  on  the  segment  label  file  for  the 
utterance,  which  is  a code  for  the  type  of  segment 
labelling.  Every  time  SENTENCE!  is  called,  the  lexical 
component  is  set  up  anew. 

e.g.  [SENTENCE!  JJW110  C] 


2.  Creacing  a lattice  of  good  words, 

[SCAN] 

A function  of  no  args  which  requests  the  lexica . 
retrieval  fork  to  find  the  n best  wordmatches  in  the 
given  utterance  (currently,  n=15),  which  it  then  puts 
into  the  word  lattice,  without  doing  anything  else  to 
them.  [They  are  no  longer  autoi.iatically  sent  to 
Semantics  for  evaluation,  as  they  had  been  in  the 
original  SfEECHLIS  control  strategy,] 


3.  Making  a theory, 

[MKTHRY  <args>] 

MKTHRY  is  an  NLAMBDA  nospread  which  can  take  any  number 
of  ar' umerts,  tach  argument  is  a wordmatch  handle, 
i.e,  'jither  a number,  corresponding  to  a wordmatch 
index,  or  a function  which  evaluates  to  a wordmatch, 
either  simple  or  fuizy.  See  13«  for  a description  of 
FUZZ?,  which  will  create  a fuzzy  wordmatch  around  a 
given  simple  wordmaten,  if  "like"  matches  exist, 

MKTHRY  creates  a theory  data  structure  and  records 
it  on  THEORYTBL,  It  also  calls  for  a lexical 
evaluation  of  the  theory,  which  may  result  in  the 
spawning  of  son  theories  whose  fuzzy  wordmatches  have 
been  reduced  or  even  replaced  by  simple  word  matches. 
Other  componential  evaluations  (i.e.  Syntactic, 
Semantic  and  Pragmatic)  can  be  called  for  separately. 
(See  5.) 
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e.g.  [tIKTHRY  11  3 (FUZZ?  14)  18] 


4,  Refining  a theory  (user-called), 

[REFTHRY  <theory  number>  <wordmatch  handle>] 

REFTHRY  is  an  NLAMBDA  which  takes  a theory  number  (e,g, 
1,2,3,,.,)  and  a wordmatch  handle,  i,e,  either  a 
number,  corresponding  to  a wordmatch  index,  or  a 
function  which  evaluates  to  a wordmatch.  either  sir. pie 
or  fuzzy.  Its  output  ’s  a new  theory,  a son  of  the 
original  one,  cc-  Gaining  the  augmented  list  of 
wordmatches.  It  is,  in  a sense,  acting  like  a 
user-created  event.  Like  MKTHRY,  lexical  evaluation  is 
also  done  on  the  theory,  which  again  may  result  in  the 
spawning  of  refined  son  theories, 

e,g,  [REFTHRY  5 6]  or  [REFTHRY  5 IFHZZ?  14)] 


5,  Evaluating  a theory, 

[SEMVAL  <theory  nunber>] 

[SYNVAL  <theory  number>] 

[PRAGVAL  <theorv  number>j 

Each  of  these  functions  may  be  called  with  either  a 
theory  number  (an  integer)  as  argument  or  no  argument 
at  all.  In  the  latter  case,  it  is  assumed  that  an 
evaluation  of  the  last  theory  created  (LASTHEORY)  is 
desired.  Each  of  these  functions  does  one  specific 
kind  of  evaluation:  semantic,  syntactic,  ci  pragmatic. 

Semantic  evaluation  of  a theory  is  performed  by 
SEMVAL,  It  is  assumed  that  the  tlieory  has  not 
previously  been  seen  by  Semantics,  which  tries  to  both 
construct  one  or  more  consistent  semantic  hypotheses 
for  the  set  of  wordmatches  contained  in  the  theory  and 
evaluate  those  hypotheses,  l/hen  SEMVAL  is  given  a 
theory  containing  more  than  one  wordmatch,  it  is  as  if 
Semantics  had  taken  over  control  from  the  Control 
component.  That  is,  local  monitors  are  set  and  local 
events  proce.ssed  as  each  word  in  the  theory  is 
considered,  until  either  a set  of  consistent  hypotheses 
is  established  for  the  entire  wordmatch  set  or  no  local 
events  remain  to  be  processed.  After  the  theory  is 
processed,  \/hat  remains  are  external  monitors  for  other 
concepts  which  could  be  of  use  to  the  theory,  SEMVAL 
is  not  fully  v/orked  out  for  multi-word  theories  yet. 
That  is,  it  is  not  clear  whetner  the  local  monitors 
should  disappear  after  processing  or  whether-  they 
should  remain  to  reduce  Semantics'’  load  when  riven 
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another  theory  to  evaluate,  containing  some  of  the  same 
wordmatches.  If  they  remain,  much  care  will  have  to  be 
taken  to  avoid  making  inappropriate  associations. 

Syntacti  ■ evaluation  (SYNVAL)  involves 
constructing  partial  parses  for  the  set  of  wordmatches 
in  the  theory.  The  syntactic  part  of  a theory  is 
currently  kept  down  in  the  lower  fork  housing  syntax, 
so  the  only  obvious  effect  of  SYNVAL  on  a theory  data 
structure  is  the  replacement  of  its  syntactic  score 
with  the  value  returned  from  SYNVAL. 

With  respect  to  pragmatic  evaluation  of  a theory, 
it  is  not  currently  clear  whether  Pragmatics  will  play 
a separate  role  in  evaluating  a theory  which  does  not 
span  the  entire  utterance.  In  any  case,  when 
implemented,  PRAGVAL  will  call  for  the  pragmatic 
evaluation  of  a theory. 

e.g.  (SYNVAL  7)  or  (SEMVAL) 

We  have  not  made  the  lexical  evaluation  of  a 
theory  a user  callable  control  primitive.  Since 
lexical  evaluation  depends  only  on  wordmatch  scores,  no 
real  "knowledge  s-u;'(.e"  needs  be  called  upon  to  compute 
it.  The  lexical  noore  for  a theory  is  currently  set  to 
the  sum  of  the  scores  of  theory  wordmatches.  For  a 
fuzzy  wordmatch,  the  score  of  its  best  member  wordmatch 
is  taken  as  the  score  for  the  whole  fuzzy. 

When  a theory  contains  fuzzy  wordmatches,  its 
lexical  evaluation  may  result  in  its  abandonment  in 
favor  of  sons  spawned  during  the  evaluation  process. 
These  sons  differ  from  their  father  in  having  more 
clearly  defined  fuzzy  wordmatches,  or  no  fuzzies  at 
all.  The  reason  for  so  refining  a theory  is  that,  to 
Syntax,  each  of  the  sons  will  now  have  a clearly 
defined  character,  which  their  father  lacks  because  it 
is  "too  fuzzy".  Refinements  are  created  when  the 
following  situation  arises:  the  best  wordmatch  in  a 
fuzz,y  is  separated  from  its  neighbor  to  the  left  or 
right  by  a one  segment  gap.  If  by  considering  some 
other  match  in  the  fuzzy,  this  gap  could  be  eliminated, 
two  new  son  theories  are  created:  one  which  contains 
the  gap  and  one  which  doesn't.  The  resulting  theories 
are  quite  different  to  Syntax,  since  adjacency  is  its 
strongest  constraint  on  how  a set  of  wordmatches  can  be 
parsed. 
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6,  Making  proposals. 

[WORDPSL  <wordlist>  <direction> 

<boundary  or  oontext>] 

[CATPSL  <catepory  list>  <direction> 

<boundary  or  context>] 

[BOTHPSL  ( <wordlistXcatePory  list>) 

sdirection>  <boundary  or  context>] 

Proposals  can  ar*se  automatically  during  the  evaluation 
of  a theory  by  one  of  the  components.  In  addition, 
there  are  three  control  primitives  which  allow  the  user 
to  make  proposals  too.  Note  that  "making"  a proposal 
is  different  from  actually  "being"  it,  i.e.,  sending  it 
down  to  the  lexical  retrieval  fork  for  execution. 
"Making"  a proposal  just  puts  it  on  the  appropriate 
proposal  queue.  Queuing  proposals  this  way  allows  for 
merging  similar  ones,  i.e.  ones  with  similar  direction 
and  intersecting  contexts,  and  also  for  deciding  which 
ones  to  do  when. 

Here  <wordlist>  is  a list  of  v/ords  like  (GO  TRAVEL 
VISIT),  <category  list>  is  a list  of  word  classes  like 
(AQX  V ADV).  the  PSL  functions  all  make  aopropriate 
checks  that  each  member  of  <v/ordlist>  is  in  the 
dictionary  and  each  member  of  <catei^ory  list>  is  a 
valid  word  class,  as  supported  by  the  lexical  retrieval 
component.  The  value  of  CATEGORIES  is  the  current  list 
of  valid  categories.  <direction>  is  either  RIGHT/OF, 
LEFT/OF  or  BETWEEN,  indicating  the  words  or  categories 
should  be  searched  for  to  the  left,  right  or  between 
the  given  segment  boundaries  or  wordmatches.  <boundary 
or  context>  then  is  either  a single  number,  indicating 
a segment  boundary,  a dotted  pair,  indicating  two 
segment  boundaries  (used  with  BETWEEN);  a list  of 
wordmatch  indices;  or  a double  list  of  wordmatch 
indices  (again,  used  with  BETWEEN,  for  left  and  right 
context).  Either  of  these  latter  two  options  may  be 
prefaced  by  "Wll",  if  the  user  wants  to  make  sure  he 
does  not  make  a mistake  and  type  a boundary  number  v/hen 
he  means  a wordmatch  and  vice  versa,  ilatches  resulting 
from  proposals,  either  user  or  component  made,  will  be 
anchored  at  the  bounoary  or  context. 

e.g.  (WORDPSL  (ANYWHERE)  .RIGHT/OF  (2  n)) 

(CATPSL  (ADJ  QUANT  ART)  LEFT/OF  7] 

(CATPSL  (PREP)  BETWEEN  ((7  11)(5))) 

(BOTHPSL  ((IJCAI)(ADJ  ADV))  RIGHT/OF  (W[l  17)) 
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7.  Doing  the  proposals. 

[DOPSLS] 

[DOPSL  <args>] 

Doing  proposals  involves  sending  them  down  to  the 
lexical  retrieval  fork  for  execution.  The  first 
function,  DOPSLS,  does  all  the  proposals  currently 
waiting  to  be  done.  The  second  function  DOPSL  can  be 
called  with  a list  of  numbers,  corresponding  to  the 
numbers  of  those  proposals  the  user  wishes  to  have 
done,  or  no  arguments  at  all,  indicating  the  user 
wishes  to  have  the  last  proposal  he  made  done.  One 
gets  the  numbers  associated  with  proposals  oy  printing 
them  out  with  (P  PROPOSALS) i 

e.g.  (DOPSLS)  5r  (DOPSL  3 1 ^)  or  (DOPSL) 


8.  Removing  a proposal. 

[REMOVEPSL  <n>] 

In  debugging,  one  may  find  that  an  incorrectly 
formatted  proposal  has  gotten  on  the  list  of  proposals. 
To  avoid  the  chance  of  sending  it  down  to  the  lexical 
retrieval  fork,  one  can  use  the  function  REMOVEPSL  to 
remove  it.  Its  argument  is  the  number  of  the  proposal 
one  wishes  to  have  removed.  Again,  one  gets  the  number 
by  printing  out  the  proposals  with  (P  PROPOSALS). 


9.  Doing  events. 

[DOEVTS] 

[DOEVT  <args>] 

[DOWORDEVTS] 

These  functions  allow  the  user  to  select  a specific  set 
or  type  of  component  generated  events  to  have  done 
(DOEVT,  DOWORDEVTS)  or  to  do  them  all  (DOEVTS).  (There 
is  currently  no  simple  way  for  the  user  to  create  his 
own  events  and  then  have  them  done.)  DOEVT  takes  as  its 
input  a list  of  event  numbers  which  can  be  gotten  by 
printing  out  the  eventqueue  with  (P  EVENTS).  The 
corresponding  events  are  then  removed  from  the 
eventqueue  and  executed  in  the  specified  order. 
DOWORDEVTS  calls  for  the  processing  of  "word"  events 
created  by  Semantics,  which  result  in  the  construction 
of  "multi-word  names"  like  "registration  lee"  and 
"travel  budget".  This  special  function  exists  because 
of  wanting  to  do  these  "word"  events  befo’e  any  other 
ones. 
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e.g.  (DOEVTS)  or  (DOEVT  3^7) 


10,  Printing  a control  data  structure, 
[P  <data  structure>] 


P is  an  NLAMBDA  which  takes  as  its  argument  the  name  of 
a data  structure,  which  it  then  prints  in  a form  easy  to 
read  and  understand.  Currently,  the  following  data 
structure  names  are  acceptable  arguments  to  P: 

LATTICE  - pr*ints  out  the  word  lattice 
EVENTS  - prints  out  the  eventqueue  in  an 

easily  readable,  though  sketchy,  way 
EVENT  <n>  - printsout  event  <n>  in  full  detail 
PROPOSALS,  PSLS  - prints  out  the  extant  proposals 
THEORIES  - prints  out  the  theories 
THEORY  <n>  - prints  out  theory  n 
MATCHES  <word>  - prints  out  all  wordmatches 

for  <word> 

WLATMON  <bdry>  - prints  out  the  word  lattice 

monitors  either  starting 
or  ending  at  <bdry> 

CFT  <n>  - prints  caseframe  token  <n> 

V/ORDSTARTS  <n>  - prints  the  list  of  wordmatches 

v;hose  left  boundary  is  <n> 

WORDENDS  <n>  - prints  the  1st  of  wordmatches 

whose  right  boundary  is  <n> 

e.v.  (P  THEORY  2)  or  (P  LATTICE) 
or  (P  HATCHES  TRIP) 


Scanning  a region, 

[SCANREGION  <direction>  <boundary  or  context>] 

SCANREGION  allows  one  to  search  a specific  recrion  of 
the  utterance,  for  example,  the  beginning  of  the 
utterance  or  a rerion  where  no  nice  words  have  been 
found,  <direction>,  as  in  the  PSL  functions,  can  be 
either  LEFT/OF,  RIGHT/OF  or  BETWEEN,  <boundary  or 
context>  has  the  same  form  as  that  used  in  the  proposal 
functions  (See  6,),  If  <direction>  is  BETV/EEN  and  the 
second  argument  is  a dotted  pair  of  boundaries,  the 
scan  is  done  sliding.  Otherwise  the  scan  is  anchored 
at  the  appropriate  side  of  the  wordmatches  or  the 
appropriate  boundary.  Like  SCAN,  SCANREGION  currently 
returns  the  15  best  matches  in  the  region.  Those 
matches  which  are  not  above  SCANTHHESHOLD  (currently 
set  to  100)  are  put  on  a list  of  REJECTS,  which  the 
user  IS  shown  and  ask  to  dispose  of.  Each  ot:e  may  oc 
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put  into  the  wordlattice  or  forgotten  about. 

e.g.  (SCANREGION  BETWEEN  (7  . 10)) 

(SCANREGION  LEFT/OF  (2  9)) 
(SCANREGION  RIGHT/OF  (WM  6 U)) 


12.  Ascertaining  better  wordmatches. 

[BETTERMATCH?  <n>] 

A long  word  which  matches  well  may  be  found  to  match 
better  if  the  original  constraints  on  its  left  and/or 
right  boundary  are  lifted.  BETTERMATCH?  takes  a 
wordmatch  index  and  looks  freely  around  the  utterance 
for  overlapping  matches  of  that  word  better  than  the 
given  one. 


13.  Creating  a fuzzy  wordmatch. 

[FUZZ?  <n>] 

FUZZ?  takes  a wordmatch  index  <n>  as  argument  and  looks 
in  the  word  lattice  for  other  matches  of  the  same  word 
which  are  fuzzily  close  to  the  given  match.  If  there 
are,  it  combines  them  into  a fuzzy  wordmatch  in 
descending  order  of  quality.  This  is  returned  as  the 
value  of  FUZZ?.  If  no  close  matches  exist,  the 

wordmatch  corresponding  to  <n>  is  returned.  (Note  a 
fu'izy  wordmatch  is  represented  as  a list  of  simple 
-wordmatches. ) 

e.g.  (FUZZ?  3) 


14.  Accessing  forks  directly. 

[MATCHCONTROL] 

[SYNCONTROL] 

There  are  debugging  situations  in  which  one  wants  to 
root  around  in  one  of  the  lower  forks  to  find  the  cause 
of  an  error.  MATCHCONTROL  will  put  the  user  in  direct 
contact  with  the  lexical  retr-'eval  fork,  and 
SYNCONTROL,  to  the  lower  LISP  fork  housing  Syntax.  To 
exit  from  the  former,  the  user  should  type  Q<cr>;  from 
the  latter,  0K<cr>. 


-45- 


BBN  Report  No,  3115 


Bolt  Beranek  and  Newman  Inc 


Incremental  Simulation-  an  example: 

The  control  strategy  we  are  simulatirc  in  the 
incremental  simulation,  excerpts  from  which  follov;  this 
introduction,  relies  initially  on  the  best  matchinn;  words 
the  lexical  retrieve]  component  can  find  on  an  anchored 
left-to-ri^^ht  scan  over  some  region  of  the  utterance, 
starting  from  the  initial  one.  After  each  left-to-right 
scan,  the  best  matching  word  is  given  to  semantics,  who 
notes  the  contexts  in  which  that  word  could  occur,  (If 
several  words  are  tied  in  word-match  for  best  match,  the 
stratepv  is  to  consider  the  likelihood  of  occurrence  of  the 
matched  nronunc iat ion  of  the  given  v;ord.)  Processing  the 
proposals  made  by  a higher-level  component  and  notices  of 
detected  word  coincidences  takes  precedence  over  doing  the 
next  left-to-rimbt  scan,  starting  from  the  right  end  of  the 
last  best  matching  word,  Durin^^  this  process,  multiple 
theories  may  be  created  either  because  of  note^  semantic 
associations  between  the  word  matches  or  jusl  excellence  of 
match  quality.  Whenever  a theory  is  spav/ned  which  has  two 
or  more  adjacent  word  matches,  it  is  sent  to  Syntax  for 
evaluation,  perhaps  resulting  in  further  proposals  or  events 
v/hose  processing,  as  before,  takes  precedence  over 
left-to-right  scan, 

^ final  element  of  this  control  strategy,  another 
deviation  from  strict  left -to-rightness , is  meant  to  get 
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around  a problem  caused  by  anchored  scans.  Requiring  a word 
to  match  starting  and/or  ending  at  some  pre-specified 
position  may  result  in  some  contorting  of  the  match  to  meet 
that  constraint  and,  therefore,  in  a lower  score  for  the 
match.  Giving  the  Matcher  freedom  to  choose  both  ends  will 
allow  it  to  make  the  best  possible  match.  Therefore,  in 
this  strategy,  if  a long  word  (longer  than  six  phonemes) 
matches  well  anchored,  a sliding  scan  is  made  for  it  to  see 
if  it  would  match  better  with  slightly  different  boundaries. 

To  make  reading  this  extract  easier,  note  that  a word 
match  is  printed  across  the  line  as: 

<wordmatch  index>  <word>  <left  boundary> 

<right  boundary>  <match  quality> 

<a  priori  likelihood  of  the  particular 
pronunciation  used  in  the  match> 

<inflection,  or  — if  uninflected> 

Lines  typed  by  the  user  are  preceded  by  a line  number 
followed  by  an  underline. 


35_SENTENCE! ( JJW1 10  C) 

T 

36_(SCANREGI0N  HIGHT/OF  0) 

1 WHAT-R  0 3 193  -31  — 

2 WHAT  0 3 193  0 -- 

3 ONE  0 3 191  0 — 

4 WHEN  031040— 

5 ON  0 3 95  -23  — 

6 WAS-R  0 3 83  -31  — 

7 WAS  0 3 83  0 — 

8 ALL  0 3 79  0 — 

9 WOULD-R  0 3 66  -31  — 

10  './ENT  0 3 46  0 — 

11  ALL02380— 

12  OH  0 2 29  0 — 

13  L.A.  0 4 27  0 — 
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in  ONTO  0 4 25  -93  — 

15  ON-R  0 3 22  -31  — 

NIL 

37_(MKTHRY  (FUZZ?  2)) 

THP.ORYy/l 

NIL 

38_(P  THEORY) 

193  THEORYy/1 

2 WHAT  0 3 193  0 — 

NIL 

NIL 

(NIL  NIL  NIL  NIL  NIL) 

NIL 

NIL 

((193  . 3) 

0 0 0 0) 

(NIL  NIL  NIL) 

NIL 

(*  semantic  evaluation  requested) 

39_SEMVAL] 

SEMANTICS  PROCESSING  THEORY#! 

2 WHAT  0 3 193  0 — 

THEORY#!  WHAT 
AS  [ WHAT  686] 

PUTTING  A CEM  ON  [ BE  684] 

PUTTING  ' CEM  ON  [ CONCEPT  Of  GIVEABLES  720] 
T 

(*  placing  case  event  monitors  on  concepts  in  the 
semantic  network) 

40_(P  THEORY) 

193  THEORY#! 

2 WHAT  0 3 193  0 — 

NIL 

ML 

(686  NIL  (CFT#!) 

((2  CFT#!)) 

(686)) 

NIL 

NIL 

((!93  . 3) 

0 0 0 0) 

(NIL  NIL  NIL) 

NIL 

4!_(P  CFT  1 ) 

(*  print  caseframe  coken) 

CASEFRAME  FOR  CONCEPT  [WHAT  BE  X QUESTIONS  685] 

( ( (REALIZES  . CLAUSE) 

(CONCEPT  . 685)) 

(HEAD  (EQU  . (BE)) 

NIL  OBL) 

(QWORD  (WHAT  . (WHAT)) 

NIL  OBL) 

(PATIENT  (HEM  . [CONCEPT  OF  GIVEABLES  720]) 
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NIL  OBD) 

NIL 

^2  (P  PROPOSALS) 

NIL 

»i3_(P  ’■■■LNTS) 

NIL 

so  CONTINUh  LEFT-TO-RIGHT) 

SO 

45_(SCANREGI0N  right/of  (2)) 

16  IS-R  3 5 72  -31  — 

17  IS  3 5 72  0 — 

18  A 3 4 35  0 — 

19  EIGHTH  3510-- 

20  AI  3 5 1 0 — 

THE  FOLLOWING  MATCHES  WERE  REJECTED  DUE  TO  THEIR  LOW  SCORE: 

21  US  3 5 -1 4 0 — 

22  A-R  3 ^ -25  -16  -- 

23  IF-R  3 5 -66  -31  — 

21]  IF  3 5 -66  0 — 

25  I 3 4 -77  0 — 

26  AM-R  ^ 4 -77  -16  — 

27  EIGHTH  3 6 -78  0 -- 

28  IS-R  3 4 -79  -31  — 

29  IS  3 4 -79  0 — 

30  ON-R  3 4 -84  -16  — 

DO  YOU  WANT  TO  PUT  ANY  OF  THEM  IN  THE  LATTICE?  TYPE  EITHER  N 
OR  A LIST 

OF  THOSE  YOU  WISH  TO  ENTER 
N 

NIL 

46„(MKTHRY  (FUZZ?  17)) 

THEORY #2 
NIL 

47_(P  THEORY) 

72  THE0RY#2 

17  IS  3 5 72  0 — 

NIL 

NIL 

(NIL  NIL  NIL  NIL  ’XL) 

NIL 

NIL 

(^72  . 2) 

0 0 0 0) 

(MIL  NIL  NIL; 

NIL 

48_SEMVAL] 

SEMAMIICS  PROCES  NG  THE0RY«2 
17  IS  3 5 72  0 -- 
THEORYf/2  IS 
AS  [ BE  684] 

PUTTING  A CEH  ON  [ WHAT  686] 

PUTTING  A CEM  ON  [ CONCEPT  OF  GIVEAPLES  720] 

AS  [ WHAT  PE  X QUESTIONS  685] 
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NOTICING  EVENT  LINKING  THE0RY#1  TO  THE0RY#2 
SCORE  = 315 
T 

149_(P  EVENTS,'' 

1 315  SEK 

THE0RY#1  WHAT 
THEGRY#2  IS 


NIL 

50_(P  PROPOSALS) 

NIL 

5i_(»  SO  DO  THE  EVENTS) 

SO 

52_DOEVTS] 

SEMANTICS  PROCESSING  EVENT  JOINING  THEORY#1 


2 WHAT  0 3 193  0 — 

TO  THE0RY#2 
17  IS  3 5 72  0 -- 
CREATING  THEORY#3 

PUTTING  A CEM  ON  [ CONCEPT  OF  GIVEABLES  720] 
AS  [ WHAT  BE  X QUESTIONS  685] 

NIL 


53_(P  THEORY  3) 

270  THEORY//3 

2 WHAT  0 3 1930-- 
17  IS  3 5 72  0 -- 
(THEORY#1  THE0RY#2) 

NIL 

(NIL  NIL  (CFT//3) 

((2  CFT#3) 

(17  CFT#3)) 

(685)) 

NIL 

NIL 

((265  . 5) 

10  0 0 0) 

(NIL  NIL  MIL) 

NIL 


54_(P  CFT  3) 

CASEFRAME  FOR  CONCEPT  [WHAT  BE  X QUESTIONS  685] 
( ( (CFTISA  685) 

(SONOF  CFT/n) 

(REALIZES  . CLAUSE) 

(CONCEPT  . 685)) 

(HEAD  (IS  . (EE)) 

NIL  OBL) 

(QWORD  (WHAT  . (WHAT)) 

NIL  OBL) 

(PATIENT  (HEM  . [CONCEPT  OF  GIVEABLES  720]) 

NIL  OBL)) 


» « « « « 
and  so  on,.. 
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